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FIELD OF THE INVENTION 

The invention relates to a method of embedding a watennark in an information 
signal which is compressed so as to include first signal samples having a given first value and 
further Slg nal samples having a different value. A typical example of such a compressed 
nrformation signal is an MPEG2 video signal in which video images are represented by 
transform coefficients, a significant number of which have the first value : 



i zero. 



BACKGROUND OF THE INVENTION 

A known method of embedding a watermark in a compressed video signal is 
chsclosed in F. Hartung and B. Girod: "Digital Watermarking of MPEG-2 Coded Video in the 
Brtstream Domain", published in ICASSP, Vol. 4, 1997, pp. 2621-2624. The watermark is a 
pseudo-noise sequence in the original, signal domain. The watennark is discrete cosine 
transformed prior to embedding. Non-zero DCT coefficients of the compressed signal are 
modified by adding thereto the conesponding coefficients of the transformed watermark 
15 sequence. 

The prior art watermark embedding scheme has some drawbacks When 
applied to motion-compensated coding, such as MPEG2, the modification of transform 
coefficients may propagate in time. Watermarks from previous frames may accumulate in the 
current frame and result in visual distortion. To avoid this, the prior art watermark embedder 
requires dnft compensation. Moreover, modification of DCT coefficients in an already 
compressed bit stream affects the bit rate. The prior art embedder therefore checks whether 
transmission of the watermarked coefficient increases the bit rate, and transmits the original 
coefficient if that is the case. 

25 OBJECT AND SUMMARY OF THE INVENTION 

It is an object of the invention to provide a method of embedding a watermark 
which alleviates the above-mentioned drawbacks. 

To this end, the method in accordance with the invention is characterized in 
that the modifying step is applied to signal samples if the modified signal sample assumes the 
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first value due to said modification. It is thereby achieved that the number of signal samples 
having the first value increases, which generally leads to a lower bit rate. It is not necessary 
to actually test the impact of a sample modification on the number of bits. 

Preferably, the signal samples qualified for modification are samples having 
5 the smallest zon-zero value (i.e. MPEG video coefficients being quantized as +1 or -1). As 
these coefficients represent noise-like information and the changes are very small 
(± quantization step), drift compensation is not necessary, and the embedded watermark is 
imperceptible but still detectable. 



1 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows schematically an arrangement for carrying out the method in 
accordance with the invention. 

Figs. 2A-2C and 3A-3G show diagrams to illustrate the operation of the 
arrangement which is shown in Fig. 1 . 

15 

DESCRIPTION OF A PREFERRED EMBODIMENT 

The invention will now be described with reference to an arrangement for 
embedding a watermark in a video signal which is compressed in accordance with the 
MPEG2 standard, although the invention is neither restricted to video signals nor to a 

20 particular compression standard. Note that the compressed signal may already have an 

embedded watermark. In that case, an additional watermark is embedded in the signal. This 
process of watermarking an already watermarked signal is usually referred to as "remarking". 

Fig. 1 shows a schematic diagram of an arrangement carrying out the method 
in accordance with the invention. The arrangement comprises a parsing unit 110, a VLC 

25 processing unit 120, an output stage 130, and a watermark buffer 140. Its operation will be 
described with reference to Figs. 2A-2C and 3 A-3G. 

The arrangement receives an MPEG elementary video stream MPin which 
represents a sequence of video images. One such video image is shown in Fig. 2A by way of 
illustrative example. The video images are divided into blocks of 8x8 pixels, one of which is 

30 denoted 201 in Fig. 2A. The pixel blocks are represented by respective blocks of 8x8 DCT 
(discrete cosine transform) coefficients. The upper left transform coefficient of such a DCT 
block represents the average luminance of the corresponding pixel block and is commonly 
referred to as the DC coefficient. The other coefficients represent spatial frequencies and are 
referred to as AC coefficients. The upper left AC coefficients represent coarse details of the 
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image, the lower right coefficients represent fine details. The AC coefficients have been 
quantized. This quantization process causes many AC coefficients of a DCT block to assume 
the value zero. Fig. 3A shows a typical example of a DCT block 300, corresponding to the 
pixel block 201 in Fig. 2A. 

The coefficients of the DCT block have been sequentially scanned in 
accordance with a zigzag pattern (301 in Fig. 3A) and variable-length encoded. The variable- 
length encoding scheme is a combination of Huffman coding and run-length coding More 
parucularly, each run of zero AC coefficients and a subsequent non-zero AC coefficient 
constitutes a run-level pair which is encoded into a single variable-length code word Fig 3B 
shows the run-level pairs of the DCT block 300. An End-Of-Block code (BOB) denotes the 
absence of further non-zero coefficients in the DCT block. Fig. 3C shows the series of 
variable-length code words representing DCT block 300 as received by the arrangement, 

In an MPEG2 elementary video stream, four such DCT luminance blocks and 
two DCT chrominance blocks constitute a macro block, a number of macro blocks constitutes 
a slice, a number of slices constitutes apicture (field or frame), and a series of pictures 
constitutes a video sequence. Some pictures are autonomously encoded (I-pictures) other 
Pictures are predictively encoded with motion compensation (P- and B-pictures). In'the latter 
case, the DCT coefficients represent differences between pixels of the current picture and 
pixels of a reference picture rather than the pixels themselves. 

The MPEG2 elementary video stream MPin is applied to the parsing unit 110 
(Fig. 1). This parsing unit partially interprets the MPEG bit stream and splits the stream into 
vanable-length code words representing luminance DCT coefficients (hereinafter- VLCs) and 
other MPEG codes. The unit also gathers information such as the coordinates of the blocks 
the coding type (field or frame), the scan type (zigzag or alternate). The VLCs and associated 
information are applied to the VLC processing unit 120. The other MPEG codes are directly 
applied to the output stage 130. 

The watermark to be embedded is a pseudo-random noise sequence in the 
spatial domain. In this embodiment of the arrangement, a 128x128 basic watermark pattern is 
"tiled" over the extent of the image. This operation is illustrated in Fig. 2B The 128x128 
basic pseudo-random watermark pattern is herein represented by a symbol W for better 
visualization. 

The spatial pixel values of the basic watermark are transformed to the same 
representation as the video content in the MPEG stream. To this end, the 128x128 basic 
watermark pattern is divided into 8x8 blocks, one of which is denoted 202 in Fig 2B The 
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blocks are discrete cosine transformed and quantized. Note that the transform and quantizing 
operation needs to be done only once. The DCT coefficients thus calculated are stored m the 
128x128 watermark buffer 140 of the arrangement. 

The watermark buffer 140 is connected to the VLC processing unit 120, in 
which the actual embedding of the watermark takes place. The VLC processing unit decodes 
(121) selected variable-length code words representing the video image into run-level parrs, 
and converts (122) the series of run-level pairs into a two-dimensional array of 8x8 DCT 
coefficients. The watermark is embedded, in a modification stage 123, by adding to each 
video DCT block the spatially corresponding watermark DCT block. The DCT block 
representing watermark block 202 in Fig. 2B is thus added to the DCT block representing 
image block 201 in Fig. 2A. However, in accordance with a preferred embodiment of the 
invention, only DCT coefficients that are turned into zero coefficients by this operation are 
selected for the purpose of watermarking. For example, the AC coefficient having the value 2 
in Fig. 3A will be modified only if the corresponding watermark coefficient has the value -2. 

In mathematical notation: 

ifc in (ij) + w(ij) = 0 

then c 0 ut(i j) = 0 

else c ou t(i j) = Ci„(ij) 
where c in is a coefficient of a video DCT block, w is a coefficient of the spatially 
corresponding watermark DCT block, and c out is a coefficient of the watermarked video DCT 

It will be appreciated that the number of zero coefficients in the DCT block is 
increased by this operation, so that the watermarked video DCT block can be more efficiently 
encoded than the original DCT block. This is particularly the case for MPEG compressed 
signals because the new zero coefficient will be included in the run of another run-level pair 
(run merge). The re-encoding is performed by a variable-length encoder 124 (Fig. 1). The 
watermarked block is applied to the output stage 130, which regenerates the MPEG stream 
by copying the MPEG codes provided by the parsing unit 110 and inserting regenerated 
VLCs provided by the VLC processing unit 120. Furthermore, the output stage 130 may 
) insert stuffing bits to make the output bit rate equal to the original video bit rate. 

In an advantageous embodiment of the invention, only the signs of the DCT 
coefficients of the watermark pattern are stored in the watermark buffer 140, so that the 
buffer stores +1 and -1 values only. This reduces the memory capacity of the buffer to 1 bit 
per coefficient (128x128 bits in total). Moreover, experiments have shown that it is sufficient 




15 



20 



25 



30 



* 

WO 02/060182 

PCT/IBOl/02708 

to apply watermark embedding to the most significant DCT coefficients only (the most 
significant coefficients are the ones occurring first in the zigzag scan). This reduces the 
memory requirements even further. Fig. 3D shows a typical example of a watermark DCT 
block 302 corresponding to the spatial watermark block 202 in Fig. 2B. 

Fig. 3E shows a watermarked video DCT block 303 obtained by addition of 
watermark DCT block 302 to video DCT block 300. In this specific example, one of the non- 
zero coefficients (the one with the value -1 in Fig. 3 A) is turned into a zero coefficient, 
because the spatially corresponding watermark coefficient has the value +1. Fig. 3F shows 
the run-level pairs of the watermarked DCT block. Note that the former run-level pairs (1,-1) 
and (0,2) have been replaced by one run-level pair (2,2). Fig. 3G shows the corresponding 
output bit stream. The run merge operation appears to save one bit in this example. 

Fig. 2C shows the watermarked image represented by the output signal MPout 
of the arrangement. The pixel block denoted 203 in this Figure corresponds to the 
watermarked video DCT block 303 in Fig. 3E. As has been attempted to express in Fig. 2C, 
the amount of watermark embedding varies from tile to tile and from block to block. 

In the example described above, only the smallest coefficients (+1 and -1) are 
qualified for modification. This circumvents the need for drift compensation and renders the 
watennark imperceptible, in particular if the number of coefficients that is modified is bound 
to a given maximum (for example, 3). 

It is to be noted that the watermark coefficient values +1 and -1 in the 
embodiment described above may also be assigned to mean the direction (positive and 
negative, respectively) in which the corresponding image coefficient is to be modified. For 
example, it may be prescribed that a given range of negative DCT coefficients (for example, 
-2 and -1) are turned into zeroes by the watermark coefficient value +1, whereas a range of ' 
positive DCT coefficients (for example, +2 and +1) are turned into zeroes by watennark 
coefficient value -1 . 

It should further be noted that an MPEG2 elementary video stream may 
include field-coded DCT blocks and frame-coded DCT blocks. In accordance therewith, the 
watermark buffer 140 may be arranged to contain two watermark patterns, one for field- 
coded blocks and one for frame-coded blocks. The pattern being used for embedding the 
watennark is then selected by the field/frame selection identification signal accommodated in 
the input video stream. 

In the above described anangement for embedding a watermark in an MPEG 
encoded signal, the "level" part of run-level pairs is changed. However, a level is not an 
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value of an AC coefficient but a quantized Won .hereof For example, to run-leve. 
^0,0 "Fig. SB^yinfi^re^.acoemcientX^.Inanoti^b.oc,,^ 
pair 1-1) may represent a coefficient X-6, depending on the quantizer step s^Needless 
I say to. to effect of tinning an AC coefficient ftom - . 04 into 0 will generaUy have a 

coefficient from -6 into 0. 

There may thus be a need to control the watermark embedding process such 
ta ,to effect thereof on visibility is reduced. To mis end, afiutor embodiment of to 
embedding method includes the step of controlling to number and/or position, of 
coefficients being modified in dependence upon to quantizer step srze. 

In an MPEG decoder, inverse quantization is achieved by multiplymg to 
reived level x(n> with to quantizer step size. The quantizer step size is contiollcdhy . 
weighting matrix W(n) which modifies to step size within a block and a scale factor QS 
Jh modifies to step size ftom (macro-)block to (macro-,b.ocl. The foUowrng equation 
, specifies MPEO's arithmetic to reconstruo, an AC coefficient X(n) ftom the decoded level 

x(n): . ... . 

X(n) = x(n)xW(n)xQS 

where n denotes the index in order of the zigzag scan. 

There are various ways to generate an upper bound for the number of 

coefficients that are allowed to be modified. In one embodiment, a level x(n) may only be 

modified if the corresponding quantizing step size Q(n)=W(n)xQS is lessthana 

predetermined threshold. Different thresholds m ay thereby be used for different pos^ons m a 

DCT block (i.e. for different indexes n). 

In another embodiment, the maximum numberN of coefficients that are 

allowed to be modified in a block is a function of to quantizer scale factor QS such that N 
decreases as QS increase, The feasibility of fins embodiment can easily be understood ,f one 
realizes to. to scale facto in fact indicates how stiong a DCT block has been quantized. 
The larger to scale factor, i.e. to larger to quantization step size, to fewer coefficrents 
may he changed in order to render to effect imperceptible. An example of such a function is: 
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where c is a given constant value. 

The quantizer scale factor QS is accommodated in MPEG bit streams as a 
combination of a parameter quantizer _scale_code and a parameter q _scalejype. The 
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parameter quantizer _scale_code is a 5-bit code. The parameter qjtcalejype indicates 
whether saxd code represents a linear range of QS-values between 2 and 62, or an exponential 
range of values between 1 and 1 12. In both cases, the code is indicative for the step size 
Accordingly, the term QS in the above-mentioned function may also be replaced by the 
parameter quantizer - scale code. 

It is also advantageous to control the positions of the coefficients being 
modified by the watermark process in dependence upon the quantizer step size. The larger 
the quantizer step size, the later in the zigzag scan the desired modifications are carried out 
Tms leaves the low-frequency coefficients unaffected and restricts the visibility of the 
watermark embedding process to the higher frequency coefficients. 

The feature of controlling the maximum number and/or the positions of 
modifiable coefficients in dependence upon the quantizer step size requires only a minor 
modfficatton of the arrangement. Such a modification can easily be carried out by a skilled 
person and is therefore not shown. 

A method and arrangement are disclosed for embedding a watermark in an 
MPEG compressed video stream. The watermark (a spatial noise pattern) is embedded by 
selectively discarding the smallest quantized DCT coefficients. The discarded coefficients are 
subsequently merged in the runs of the remaining coefficients. The decision whether a 
coefficient is discarded or not is made on the basis of a pre-calculated watermark buffer and 
the number of already discarded coefficients per 8x8 DCT block. The advantages of this 
method are (i) a very simple bit rate control system and (ii) no need for drift compensation 
The algorithm can be implemented in a very efficient manner with respect to memory 
requirements and computational complexity. 
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CLAIMS: 



1 . A method of embedding a watermark in an information signal which is 

compressed so as to include first signal samples having a given first value and further signal 
samples having a different value, the method comprising the step of modifying signal 
samples in accordance with a watermark pattern, characterized in that said mo<iifying step is 
5 applied to signal samples if the modified signal sample assumes the first value due to said 
modification. 



2. The method as claimed in claim 1, wherein the first value is zero and the 
signal samples qualified for modification are signal samples having the smallest non-zero 

10 value. 

3. The method as claimed in claim 1, wherein the signal samples have been 
quantized with a quantizer step size, and the signal samples qualified for modification are 
signal samples being quantized with a step size which is less than a predetermined threshold. 

15 

4. The method as claimed in claim 1 , wherein the information signal is divided 
into sections and the number of signal samples qualified for modification is limited to a 
predetermined maximum per section. 

20 5. A method as claimed in claim 4, wherein the signal samples of a section have 

been quantized in accordance with a quantizer step scale, the method including the step of 
controlling said maximum of modified signal samples in dependence upon said quantizer step 
scale. 



25 6. A method as claimed in claim 1 , wherein the information signal is divided into 

sections and the signal samples of a section have been quantized in accordance with a 
quantizer step scale, the method including the step of controlling the positions of the signal 
samples qualified for modification within a section in dependence upon said quantizer step 
scale. 
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7. The method as claimed in any one of claims 1-6, wherein the compressed 

sxgnal mcludes variable-length code words each identifying a run of first signal samples and a 
subsequent or preceding further signal sample, the method further comprising the steps of 

- decoding the variable-length code words into respective first and further signal samples 
prior to said modifying step; 

- merging the modified signal sample with succeeding or preceding first signal samples to 
obtain a new run of first signal samples, and 

- encoding the new run of first signal samples and a subsequent or preceding further signal 
sample into a new variable-length code word. 

8- An arrangement for embedding a watermark in an information signal which is 

compressed so as to include first signal samples having a given first value and further signal 
samples having a different value, the arrangement comprising means for modifying signal 
samples m accordance with a watermark pattern, characterized in that the modifying means 
are arranged to modify signal samples if the modified signal sample assumes the first value 
due to said modification. 
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